|
|
Accession Number |
TCMCG075C05680 |
gbkey |
CDS |
Protein Id |
XP_017971799.1 |
Location |
complement(join(5541700..5541768,5542629..5542689,5543499..5544076,5544474..5544547,5544634..5544930,5545257..5545358,5545538..5545606,5546330..5546402,5546663..5546746,5546896..5546982,5547079..5547138,5547719..5547811,5548041..5548160,5549220..5549423,5549516..5549566,5549928..5550016,5550116..5550248,5550456..5550599,5551078..5551592,5551846..5551922,5552043..5552158,5552272..5552413,5552509..5552717)) |
Gene |
LOC18608025 |
GeneID |
18608025 |
Organism |
Theobroma cacao |
|
|
Length |
1148aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018116310.1
|
Definition |
PREDICTED: DNA repair protein REV1 isoform X1 [Theobroma cacao] |
CDS: ATGAGTCTGGATTCTTCTCGCTCTGCGAATTCAGGCCCCCAGAATTCGAAAAGAAGCTTCAATTCAAATTCTTCAAACAATAAGGATAACAGCAGTAATAGCAAGAAGAGAAAAAGTAACCAGAAAACCCTAGGCATGGCTTGGGGCGCCAATTCTCTCTCTACCTCTCGCTCCTCTTTCCGCTCTTCTCCTTTCTCTGATTTTGGAAGTTATATGGTAGAAAAGAATAGAAAGCTTCAAAATCAGTTTGATGCCGAGGCTTCAAATTCTTCTCTTAGCGATACTTCTACGAAGCCTATATTTCGCGGGGTTTCTATCTTCGTTGATGGTTTCACGGTTCCGTCTAGTCAGGAACTGCGGCGATACATGTTGAATTATGGTGGACGATTCGAGAATTATTTTTCTAGGCATCGAGTCACGCATATTATCTGCAGCAATCTCCCTGATAGTAAAATCAAGAATATCAGGTCCTTCAGCGGTGGACTGCCAGTAGTGAAACCTACGTGGGTTCTAGATTCTGTTGCTGTTAACAGACTTTTGAGTTGGGTTCCCTACCAGCTTGACCAGCTTGCTAGCAATCAACCAACATTGTCAACCTTCTTCACTTCAAAAATCAGCCCTGCATCTGAGGGCGTTTTTGCAGATGCAATTTGTGAAGTAAAACATGGAACTGAGGATTTATGTTTAAAGGATGCATCAAAGGACGCAAAATTCTCTGAAGCAGGTGAGCCCTCTGAATGGAGGAAGAAAATTACTGAAGAACATGATGAACTTATGCATGGAAATACTAATTCAAAAGTAATTGAGGAGCCAAGTAGTAGCTATAGTGAAGCATCTCAGGAAGTAAAAGTGGTAGAACGAAGTAATCTGGAACAAGATGATGAAAGCAGGGAAAACAATAGACCTCAGTACTGTCCTGAACAACCCTCTGCTTCTGTTAGTAGCCACTGCTTTGACAATCACAGTGTAAAAGGATCGCCCCATTCAACAGCCCTTGGACCTTTGAAGCAGTGTCATTCAACTCTTGGAGATCCCAATTTTGTGGAGAACTACTTCAAGAATTCAAGGCTGCATTTCATAGGAACCTGGAGAAATAGATATCGTAAGCGTTTTCCCAGCTTGCCAAATGGGTTCAAGTGCATGAATTCTCATTCGGATGTTTCGGCTGATACTCAGAAGACTGCCATTATACATATTGATATGGACTGTTTTTTCGTCTCAGTGGTCATCAGGAGCCATCCTGAATTACATGACAAGCCTGTAGCTGTATGCCATTCGGACAATCCAAAAGGAACTGCTGAAATCTCTTCTGCCAATTATCCTGCTCGAGATTATGGAATTAGGGCAGGAATGTTTGTTAGAGATGCCAAGGCACTTTGCACCCACCTTGTTATTCTCCCATACAACTTTGAAGCATATGAGGAGGTTGCTGATCAGTTTTATAACATCTTGCATAAGTACTGCAACAGAGTTCAGGCTGTCAGCTGTGATGAAGCATTTTTAGATGTCACAGACTTAGAAGGGGAAGATCCTAAGCTTTTAGCTTCAGCAATACGGAAAGAGATATTTGAAGCTACTGGATGCACTGCAAGTGCTGGGATAGCTGTGAATATGCTTATGGCCCGTCTAGCCACCAGAACTGCTAAACCAAATGGTCAATGCTACATTTCTCCTGAGAGGGTTGATGAGTATTTAGACCAACTTCCATTAAAAGCACTTCCAGGAATAGGGCATGTGCTAGAGGAAAAGTTGAAAAATAGAAATGTTAGAACTTGTGGACAGTTGCGTATGATTTCTAAGGGCTCCCTTCAAAAGGATTTTGGGTTTAAAACTGGTGAGATGCTCTGGAATTACAGTAGAGGAGTGGATAATCGACTTGTTGGAACAATTCAGGAGAGCAAGTCTGTGGGGGCTGAAGTGAACTGGGGTGTAAGATTCAGGGATTTGCAAGATACCCAGCACTTTCTCTTGGACCTTTGCAAAGAGGTTTCATTGCGCTTGCAGGGGTGTGGGGTGCAAGGGCGAACTTTCACACTTAAGATAAAAAAGAGAAGGAAAGATGCTGGGGAGCCTGCAAAGTATATGGGCTGTGGAGACTGTGAAAACCTGAGCCATAGCACAACGGTTCCACTTGCCACTGATGATGTCGAAGTGCTTCAGAGAATTACAAAGCAGCTCTTTGGATTTTTCCACGTAGATGTCAAGGATATCCGGGGTGTTGGTTTGCAAGTTTCGAGGCTTGAAAGCGTAGATACTTCTAAGCAAGTGCTTGAGAGGAATTCGTTGAAATCATGGCTTATGTCTGCCTCAGCAAGCTCAGAAGAACGATGTGATGTCAGTAGTATAGCCAAAGACAGGGTTGGTACAGATACTGAAGGAAAGAGCATGGGTGGAAATTCAGGTGTGTTATGCACTGATCCAGTGGGGAATTCTGTTCTTAGGACAAATAATACATCCAATGGTGATGGTTGTTCAAACCAGATCTTAAGCATCCCACAGTTATGCCACCTCGATATGGGAGTAGTGGAGAGTCTTCCATCAGAGCTCCAGTCAGAATTAAATGAAATGTATGGTGGGAAGCTAGTTGATTTGATTGCTAAAAGTAAAGGACAAGGTGAGAACAGCACCGGTTCTTTATGCTTCCATCCTCCTGAACTATCCAAAGTTGCAATAGAGGAAGCAGAAAGATCTCACAATTCTGATCCTATCTCATTGAGCAGAACAGCTGTGGAAATGATGGGCAAACAGCATATATTGGAGGAACTGCAGACAGTGCCTGACTCTGGGACTGGATCCAACAGTAATGCTATTTCCATTCAAGCACTTGATAATAATGATTTAATGCCTTCATCTCTAAGCCAAGTAGACACATCAGTGTTACAGCAGTTGCCTGAGGAATTGAGAGCTGACTTATTTGAGTCGCTTCCTGCACACAGGAGGCAAGAAATCTCTACCCTGGGCCCTAATAGGGATAATTTGCATCATCCATTATGCATCAATCAACCTGAATCAACTGATTCTGGGCTGACCAACAATCTCTGGATTGGAAATCCTCCACTGTGGGTTGATAAGTTTAAAGTCAGCAATTTGTTGATGTTGAGATTTTTTGCTGACATGTACTACAAATCAAAGTCAGCTGAGAATTTATCTTCAATTCTGCAATGCACTATTGCTGAATCTTTACATCCTTTAGATGCAAAATGTGACGCTTGGAATGAAGCTGTTCACAGCTTCAATGAGCTTCTCATGGAGTATATTAAACTGAAGATAGTAGTAGATATTGAGGAGATCTATGTTTGTTTTCGTCTTCTAAGAAGGCTAAGTACCAAGTCAGAATTTTTCTTGGAAGTGTACAATTTGGTCTTCCCCCACCTTCAGGATTTATGCTTTGGAAAATTAAACACATTTTTAGCACTGGAGTTTTCAATTTTAACTTCTTTACATTAA |
Protein: MSLDSSRSANSGPQNSKRSFNSNSSNNKDNSSNSKKRKSNQKTLGMAWGANSLSTSRSSFRSSPFSDFGSYMVEKNRKLQNQFDAEASNSSLSDTSTKPIFRGVSIFVDGFTVPSSQELRRYMLNYGGRFENYFSRHRVTHIICSNLPDSKIKNIRSFSGGLPVVKPTWVLDSVAVNRLLSWVPYQLDQLASNQPTLSTFFTSKISPASEGVFADAICEVKHGTEDLCLKDASKDAKFSEAGEPSEWRKKITEEHDELMHGNTNSKVIEEPSSSYSEASQEVKVVERSNLEQDDESRENNRPQYCPEQPSASVSSHCFDNHSVKGSPHSTALGPLKQCHSTLGDPNFVENYFKNSRLHFIGTWRNRYRKRFPSLPNGFKCMNSHSDVSADTQKTAIIHIDMDCFFVSVVIRSHPELHDKPVAVCHSDNPKGTAEISSANYPARDYGIRAGMFVRDAKALCTHLVILPYNFEAYEEVADQFYNILHKYCNRVQAVSCDEAFLDVTDLEGEDPKLLASAIRKEIFEATGCTASAGIAVNMLMARLATRTAKPNGQCYISPERVDEYLDQLPLKALPGIGHVLEEKLKNRNVRTCGQLRMISKGSLQKDFGFKTGEMLWNYSRGVDNRLVGTIQESKSVGAEVNWGVRFRDLQDTQHFLLDLCKEVSLRLQGCGVQGRTFTLKIKKRRKDAGEPAKYMGCGDCENLSHSTTVPLATDDVEVLQRITKQLFGFFHVDVKDIRGVGLQVSRLESVDTSKQVLERNSLKSWLMSASASSEERCDVSSIAKDRVGTDTEGKSMGGNSGVLCTDPVGNSVLRTNNTSNGDGCSNQILSIPQLCHLDMGVVESLPSELQSELNEMYGGKLVDLIAKSKGQGENSTGSLCFHPPELSKVAIEEAERSHNSDPISLSRTAVEMMGKQHILEELQTVPDSGTGSNSNAISIQALDNNDLMPSSLSQVDTSVLQQLPEELRADLFESLPAHRRQEISTLGPNRDNLHHPLCINQPESTDSGLTNNLWIGNPPLWVDKFKVSNLLMLRFFADMYYKSKSAENLSSILQCTIAESLHPLDAKCDAWNEAVHSFNELLMEYIKLKIVVDIEEIYVCFRLLRRLSTKSEFFLEVYNLVFPHLQDLCFGKLNTFLALEFSILTSLH |